Replication Material for "The Measurement of Partisan Sorting for 180 Million Voters"

## 1 Information

Authors: Jacob R. Brown & Ryan D. Enos
Date: March 11, 2021
Contact email: jrbrown@g.harvard.edu/renos@gov.harvard.edu

### ### ### ###

# File overview

## 2.1 Data files

us-voters-exposure-isolation-main.csv - This file contains the main data set used for the main results, i.e. the measures of spatial and aspatial exposure to Democrats and Republicans for all voters in the data.

us-voters-variables.csv - This file contains geographic and demographic variables for all voters in the data.

us-voters-exposure-isolation-white-neighbors.csv - This file contains the measures of spatial and aspatial exposure to Democratic and Republican neighbors within each voter's white neighbors.

us-voters-exposure-isolation-within-race.csv - This file contains the measures of spatial and aspatial exposure to Democratic and Republican neighbors calculated within each voter's non-white neighbors, broken out by race (Asian, Black, Hispanic). Only non-white voters are included in this file.

us-voters-exposure-isolation-square-rank.csv - This file contains the measures of spatial exposure to Democratic and Republcian neighbors under two alternate versions of the measure: one where weights are constructed using squared distance, and one where weights are based on ranking neighbors by proximity.

us-voters-exposure-isolation-no-imps.csv - This file contains measures of spatial and aspatial exposure to registered Democrats and Republican neighbors. These versions of the measures do not use imputation to classify neighbors. Only voters from states that record partisanship are included in this file.

us-voters-exposure-isolation-discrete.csv - This file contains measures of spatial and aspatial exposure to registered Democrats and Republican Neighbors, discretizing the posterior measures of partisanship when calculating exposure.

relative-exposure-aggregate-data.Rdata - This file contains relative exposure statistics at the cbsa, county, city, zip code, and Census tract level.

survey-comps.Rdata - This file contains summaries of descriptive statistics of the survey sample compared to the registered population.

sample-50k.csv - This file contains measures of spatial and aspatial Democratic and Republican exposure for a sample of voters, where exposure was calculated across a range of nearest neighbors, up to 50,000.

partisan-survey-analysis.csv - This file contains the survey responses used to validate the partisan impuation.

partisan-seg-county-summaries.csv - This file contains average summaries of Democratic and Republican exposure at the county level.

county-normal-vote-pres-2008-2016.csv - This file contains the presidential election results from 2008-2016 at the county level.

## 2.3 Codebooks

We include codebooks for all data files in the repository. Codebooks are labelled 'Codebook_*.csv', where * is the name of the corresponding data set.

# 3 Other information

## 3.1 Anonymization

We have removed all identifiable information from the voter information, including voter names, residential address, age, gender, vote history. We have also replaced all geography variables except state with uninformative ids.

## 3.2 Tables and Figures without replication code

- We do not include code or data to replicate Figures 1, 2, and S4, or for Tables 1-4.
- The data required to make these figures and tables are identifiable, and therefore we omit these figures and tables.

## 3.3 Memory Requirements

- Many of the scripts, particularly those that handle the nationwide voter data, require very large memory allocations to run. We recommend requesting 250-300 GB of memory to run these scripts.
